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of its optimal solution is an algebraic number. We study the degree of the minimal polynomials of these 
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1. Introduction 

A fundamental question about any optimization problem is how the output depends on the 
input. The set of optimal solutions and the optimal value of the problem are functions of the 
parameters, and it is important to understand the nature of these functions. For instance, 
for a linear programming problem, 

maximize c • x subject to A - x — h and x > 0, (1-1) 

the optimal value is convex and piecewise linear in the cost vector c and the right hand side b, 
and it is a piecewise rational function of the entries of the matrix A. The area of mathematics 
which studies these functions is geometric combinatorics, specifically the theory of matroids 
for the dependence on A, and the theory of polyhedral subdivisions [5 for the dependence 
on b and c. 

For a second example, consider the following basic question in game theory: 

Given a game, compute its Nash equilibria. (1-2) 

If there are only two players and one is interested in fully mixed Nash equilibria then this 
is a linear problem, and in fact closely related to linear programming. On the other hand, 
if the number of players is more than two then the problem ()1.2p is universal in the sense 
of real algebraic geometry: Datta |4j showed that every real algebraic variety is isomorphic 
to the set of Nash equilibria of some three-person game. A corollary of her construction is 
that, if the Nash equilibria are discrete, then their coordinates can be arbitrary algebraic 
functions of the given input data (the payoff values which specify the game). 
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Our third example concerns maximum likelihood estimation in statistical models for 
discrete data. Here the optimization problem is as follows: 

Maximize pi(6l)"ip2(6')"' • • •Pn(6')"" subject to 6* 6 6», (1.3) 

where O is an open subset of M™, the Pi{9) arc polynomial functions that sum to one, and the 
Ui are positive integers (these are the data). The optimal solution 9^ which is the maximum 
likelihood estimator, depends on the data: 

(wi, . . . , w„) ^ 0{ui, . . . , w„). (1.4) 

This is an algebraic function, and recent work in algebraic statistics [3lfT2] has led to a formula 
for the degree of this algebraic function, under certain hypothesis on the polynomials pi (0) 
which specify the statistical model. 

The aim of the present paper is to conduct a similar algebraic analysis for the optimal 
value function in semidefinite programming. This function shares some key features with each 
of the previous examples. To begin with, it is a convex function which is piecewise algebraic. 
However, unlike for (|l.ip . the pieces are non-linear, so there is a notion of algebraic degree 
as for Nash equilibria (|1.2p and maximum likelihood estimation (|1.3p . However, semidefinite 
programming does not exhibit universality as in [4] because the structure of real symmetric 
matrices imposes some serious constraints on the underlying algebraic geometry. It is these 
constraints we wish to explore and characterize. 

We consider the semidefinite programming (SDP) problem in the form 

maximize trace(-B • F) subject to Y and Y ^ 0. (1-5) 

where B is a real symmetric n x n-matrix, U is a m-dimensional affine subspace in the ("^^)- 
dimensional space of real n x n-symmetric matrices, and Y ^ means that Y is positive 
semidefinite (all n eigenvalues of Y are non-negative). The problem (jl.Sp is feasible if and 
only if the subspace U intersects the cone of positive semidefinite matrices. In the range 
where it exists and is unique, the optimal solution Y of this problem is a piecewise algebraic 
function of the matrix B and the subspace U. Our aim is to understand the geometry of 
this function. 

Note if U consists of diagonal matrices only, then we are in the linear programming case 
(jl.ip . and the algebraic degree of the pieces of Y is just one. What we are interested in is 
the increase in algebraic complexity which arises from the passage from diagonal matrices 
to non-diagonal symmetric matrices. 

Example 1 (Elliptic Vinnikov curves) Let n = 3 and m = 2, so Y runs over a two- 
dimensional affine space of symmetric 3 x i-matrices. Then Y >Q specifies a closed convex 
semi- algebraic region in this plane, whose boundary is the central connected component of 
a cubic curve as depicted in Figure [H This curve is a Vinnikov curve, which means that it 
satisfies the following constraint in real algebraic geometry: any line which meets the interior 
of the convex region intersects this cubic curve in three real points. See \llVil5^ 0. 211 for 
details. However, there are no constraints on Vinnikov curves in the setting of complex 
algebraic geometry. The Vinnikov constraints involve inequalities and no equations. This 
explains why the curve in Figure [7] is smooth. 

Our problem lll.5\) is to maximize a linear function over the convex component. Alge- 
braically, the restriction of a linear function to the cubic curve has six critical points, two 
of which are complex. They correspond to the intersection points of the dual curve with a 
line in the dual projective plane. The degree six curve dual to the elliptic Vinnikov curve is 
depicted in Figure\M 
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Fig. 1. The convex component in the center of this elliptic Vinnikov curve is the feasible region for SDP 
with m = 2,n = 3. 




Fig. 2. The dual to the elliptic Vinnikov curve in Figure [T] is a plane curve of degree six with three real 
singular points. 

Our analysis shows that the algebraic degree of SDP equals six when m ~ 2 and n = 3. 
// the matrix B and the plane lA are defined over Q then the coordinates of the optimal 
solution Y are algebraic numbers of degree six. By Galois theory, the solution Y cannot in 
general be expressed in terms of radicals. For any specific numerical instance we can use the 
command "galois" in maple to compute the Galois group, which is then typically found to 
be the symmetric group Sq. □ 
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Fig. 3. The convex component in the center of this Cayley cubic surface is the feasible region for SDP with 
m = n = 3. 

Example 2 (The Cayley Cubic) Now, suppose that m — n — 3. Then dct{Y) — is a 
cubic surface, but this surface is constrained in the context of complex algebraic geometry 
because it cannot be smooth. The cubic surface det(y) = has four isolated nodal singular- 
ities, namely, the points where X has rank one. This cubic surface is known to geometers 
as the Cayley cubic. In the optimization literature, it occurs under the names elliptope or 
symmetroid. Optimization experts are familiar with (the convex component of) the Cay- 
ley cubic surface from (the upper left hand picture in) Christoph Helmberg's SDP web page 
|http : //www -user . tu-chemnitz . de/ ^helmberg/ semidef . html[ 

The surface dual to the Cayley cubic is a surface of degree four, which is known as the 
Steiner surface. There are now two possibilities for the optimal solution Y of il.5]) . Either Y 
has rank one, in which case it is one of the four singular points of the cubic surface in Figure 
or y has rank two and is gotten by intersecting the Steiner surface by a line specified by 
B. In either of these two cases, the optimal solution Y is an algebraic function of degree 
four in the data specifying B and hi. In particular, using Girolamo Cardano's Ars Magna, 
we can express the coordinates ofY in terms of radicals in (B,U). □ 

The objective of this paper is to study the geometric figures shown in Figures 1, 2 and 
3 for arbitrary values of n and m. The targeted audience includes both algebraic geometers 
and scholars in optimization. Our presentation is organized as follows. In Section 2 we review 
SDP duality, we give an elementary introduction to the notion of algebraic degree, and we 
explain what it means for the data U and B to be generic. In Section 3 we derive Pataki's 
inequalities which characterize the possible ranks of the optimal matrix Y, and, in Theorem 
[71 we present a precise characterization of the algebraic degree. The resulting geometric 
formulation of semidefinite programming is our point of departure in Section 4. Theorem [TO] 
expresses the algebraic degree of SDP as a certain bidegree. This is a notion of degree for 
subvarieties of products of projective spaces, and is an instance of the general definition of 
multidegree in Section 8.5 of the text book [T7]. Theorem [TT] gives explicit formulas for the 
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algebraic degree, organized according to the rows of Table [21 In Section 5 we present results 
involving projective duality and determinantal varieties, in Section 6 this is combined with 
results of Pragacz [19] to prove Theorem [TTJ and to derive the general formula stated in 
Theorem [T^ and Conjecture HU 

Two decades ago, the concept of algebraic degree of an optimization problem had been 
explored in the computational geometry literature, notably in the work of Bajaj |2j . However, 
that line of research had only few applications, possibly because of the dearth of precise 
results for geometric problems of interest. Our paper fills this gap, at least for problems 
with semidefinite representation, and it can be read as an invitation to experts in complexity 
theory to take a fresh look at Bajaj 's conclusion that "... the domain of relations between 
the algebraic degree ... and the complexity of obtaining the solution point of optimization 
problems is an exciting area to explore." [21 page 190]. 

The algebraic degree of semidefinite programming addresses the computational complex- 
ity at a fundamental level. To solve the semidefinite programming exactly essentially reduces 
to solve a class of univariate polynomial equations whose degrees are the algebraic degree. 
As we will see later in this paper, the algebraic degree is usually very big, even for some small 
problems. An explicit general formula for the algebraic degree was given by von Bothmer 
and Ranestad in the paper [25j which was written subsequently to this article. 



2. Semidefinite programming: duality and symbolic solutions 

In this section we review the duality theory of semidefinite programming, and we give an 
elementary introduction to the notion of algebraic degree. 

Let M be the field of real numbers and Q the subfield of rational numbers. We write 
iS" for the ("^^) -dimensional vector space of symmetric n x n- matrices over M, and QiS" 
when we only allow entries in Q. A matrix X 6 S" is positive definite, denoted X )^ 0, if 
u^Xu > for all u G M"\{0}, and it is positive semidefinite, denoted X ^ 0, if u^Xu > 
for all u G M". We consider the semidefinite programming (SDP) problem 

min C»X (2.1) 

xes^ 

s.t. A,»X^b^ fori=l,...,m (2.2) 

and X hO (2.3) 

where b G Q", C, Ai,...,A^e (Q)5". The inner product C • X is defined as 

C»X := tr£ice{C-X) = forC,A:G5". 

This is a linear function in X for fixed C, and that is our objective function. The primal SDP 
problem p.2|) - ()2.3p is called strictly feasible, or the feasible region has an interior point, if 
there exists some X y such that (|2.2p is met. 

Throughout this paper, the words "generic" and "genericity" appear frequently. These 
notions have a precise meaning in algebraic geometry: the data C,b, Ai, . . . , Am are generic 
if their coordinates satisfy no non-zero polynomial equation with coefficients in Q. Any 
statement that is proved under such a genericity hypothesis will be valid for all data that lie 
in a dense, open subset of the space of data, and hence it will hold except on a set of Lebesgue 
measure zero. For a simple illustration consider the quadratic equation at'^ + pt + ^ = 
where t is the variable and a, (3, 7 are certain parameters. This equation has two distinct 
roots if and only if the discriminant a(/3^ — Aaj) is non-zero. The equation a(/3^ — Aaj) — 
defines a surface, which has measure zero in 3-space. The general point {a, (3, 7) does not lie 
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on this surface. So we can say that at^ + /3t + 7 = has two distinct roots when a, /?, 7 are 
generic. 



The convex optimization problem dual to (|2.ip - (|2.2p is as follows: 

max b'^y (2.4) 

m 

s.t. A{y) -.^ C-^y^A^ h 0. (2.5) 

i=l 

Here the decision variables t/i, . . . , ym are real unknown. The condition (|2.5p is also called a 
linear matrix inequality (LMI) . We say that (|2.5p is strictly feasible or has an interior point 
if there exists some y G R™ such that A(?/) ^ 0. 

Our formulation of semidefinite programming in (|1.5p is equivalent to (|2.4p - (|2.5p under 
the following identifications. Take 14 to be the afRne space consisting of all matrices C — 
Vi-^i where y £ R™, write Y — A{y) for an unknown matrix in this space, and fix a 
matrix B such that B • Ai = —bi for i = 1, . . . , to. Such a choice is possible provided the 
matrices Ai are chosen to be linearly independent, and it implies B •¥ — B • C ~ b'^y. 

We refer to [231126] for the theory, algorithms and applications of SDP. The known results 
concerning SDP duality can be summarized as follows. Suppose that X £ 5" is feasible for 
(I221)-(I231) and y G is feasible for (HH). Then C»X-b'^y = A{y) • X > 0, because the 
inner product of any two semidefinite matrices is non-negative. Hence the following weak 
duality always holds: 

sup h^y < inf C • X. 

When equality holds in this inequality, then we say that strong duality holds. 

The cone of positive semidefinite matrices is a self-dual cone. This has the following 
important consequence for any positive semidefinite matrices A{y) and X which represent 
feasible solutions for (|2.2|) - (|2.3p and (|2.5p . The inner product A{y) • X \s zero if and only if 
the matrix product A{y) ■ X is the zero matrix. The optimality conditions are summarized 
in the following theorem: 

Theorem 3 (Section 3 in [23] or Chapter 4 in [26]). Suppose that both the primal 
problem I12.1\) - I[273\) and the dual problem ^2^-(2^ are strictly feasible. Then strong du- 
ality holds, there exists a pair of optimal solutions, and the following optimality conditions 
characterize the pair of optimal solutions: 

A,»X = b, for i = 1,2, . . . ,TO, (2.6) 

Aiy)-X^O, (2.7) 

A{y) h and X hO. (2.8) 



The matrix equation ()2.7p is the complementarity condition. It implies that the sum of the 
ranks of the matrices A(y) and X is at most n. We say that strict complementarity holds if 
the sum of the ranks of A{y) and X equals n. 

Suppose now that the given data C,Ai,..., Am and b are generic over the rational 
numbers Q. In practice, we may choose the entries of these matrices to be random integers, 
and we compute the optimal solutions y and X from these data, using a numerical interior 
point method. Our objective is to learn by means of algebra how y and X depend on 
the input C, Ai, . . . , Am, b. Our approach rests on the optimality conditions in Theorem O 
These take the form of a system of polynomial equations. If we could solve these equations 
using symbolic computation, then this would furnish an exact representation of the optimal 
solution {X,y). We illustrate this approach for a small example. 
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Example 4 Consider the following semidefinite programming problem: 



Maximize yi+y2+ 2/3 

2/3 + 1 2/1 + y2 2/2 2/2 + 2/3 

2/1 + 2/2 -2/1 + 12/2-2/3 2/2 

2/2 2/2 - 2/3 2/2 + 1 2/1+ ?/3 

2/2 + 2/3 2/2 2/1 + 2/3 -2/3 + 1 



subject to A{y) 



>- 0. 



This is an instance of the LMI formulation 1^2. 4\ )- (275]) with m = 3 and n ~ 4. Using the 
numerical software SeDuMi ^2JJ, we easily find the optimal solution: 



iyi,y2,h) = (0.3377..., 0.5724..., 0.3254., 



What we are interested in is to understand the nature of these three numbers. 

To examine this using symbolic computation, we also consider the primal formulation 
i2.1\) - l[27^) . Here the decision variable is a symmetric 4 x A-matrix X = (xij) whose ten 
entries Xij satisfy the three linear constraints 12. 6\} : 



Ai» X = ~2xi2 + X22 - 2a;34 = 1 

A2» X = -2xi2 - 2xi3 - 2x14 - 2a;23 - 2a;24 - 2:33 = 1 
A^* X — xii — 2xi4 + 2.T23 - 20:34 + 0:44 — 1 



In addition, there are sixteen quadratic constraints coming from the complementarity con- 
dition \2. 7[ ), namely we equate each entry of the matrix product A{y) ■ X with zero. Thus 
ll2.6\) - [27?^ translates into a system of 19 linear and quadratic polynomial equations in the 
13 unknowns 2/1, 2/2, 2/3, a;ii, 2:12, . . . ,2:44. 

Using symbolic computation methods (namely, Grobner bases in Macaulay 2 JEI), we 
find that these equations have finitely many complex solutions. The number of solutions is 
26. Indeed, by eliminating variables, we discover that each coordinate yi or Xjk satisfies a 
univariate equation of degree 26. Interestingly, these univariate polynomials are not irre- 
ducible but they factor. For instance, the optimal first coordinate yi satisfies the univariate 
polynomial /(2/1) which factors into a polynomial g{yi) of degree 16 and a polynomial of 
degree hijji) of degree 10. Namely, we have /(2/1) = 5(2/1) " h{yi), where 

5(2/1) = 403538653715069011 yl^ - 24807748649488603047/15 
+ 6231483282173647552 yl"^ - 5986611777955575152 yl^ 
+ + 

+ 59396088648011456 2/1 - 4451473629111296 2/1 + 149571632340416 



and h{yi) ^ 2018 1/1° - 12156 2/? + 17811 2/? +■•■+ 1669 2/1 - 163. 

Both of these polynomials are irreducible in Q[yi]. By plugging in, we see that the optimal 
first coordinate yi = 0.3377 . . . satisfies g{yi) = 0. Hence jji is an algebraic number of degree 
16 over Q. Indeed, each of the other twelve optimal coordinates 2/2, ys, 2:11, ii2, . . . , 2:44 also 
has degree 16 over Q. We conclude that the algebraic degree of this particular SDP problem 
is 16. Note that the optimal matrix A(ij) has rank 3 and the matrix X has rank 1. 

We are now in a position to vary the input data and perform a parametric analysis. For 
instance, if the objective function is changed to 2/1 — 2/2 then the algebraic degree is 10 and 
the ranks of the optimal matrices are both 2. □ 
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The above example is not special. The entries for m = 3, n = 4 in Table [2] below inform 
us that, for generic data {C,b,Ai), the algebraic degree is 16 when the optimal matrix 

Y = A{y) has rank three, the algebraic degree is 10 when the optimal matrix Y = A{y) has 
rank two, and rank one or four optimal solutions do not exist. These former two cases can be 
understood geometrically by drawing a picture as in Figure 3 and Example [2] The surface 
det(F) — has degree four and it has 10 isolated singular points (these are the matrices 

Y of rank two). The surface dual to the quartic det(F) = has degree 16. Our optimal 
solution in Example [4] is one of the 16 intersection points of this dual surface with the line 
specified by the linear objective function B uY = b ■ . The concept of duality in algebraic 
geometry will be reviewed in Section 5. 

For larger semidefinite programming problems it is impossible (and hardly desirable) to 
solve the polynomial equations (|2.6p ~ (|2.7p symbolically. However, we know that the coordi- 
nates iji and Xjk of the optimal solution are the roots of some univariate polynomials which 
are irreducible over Q. If the data are generic, then the degree of these univariate polynomi- 
als depends only on the rank of the optimal solution. This is what we call the algebraic degree 
of the semidefinite programming problem (|2.ip - (|2.3p and its dual (|2.4p - (|2.5p . The objective 
of this paper is to find a formula for this algebraic degree. 



3. Prom Pataki's inequalities to algebraic geometry 

Example m raises the question which ranks are to be expected for the optimal matrices. This 
question is answered by the following result from the SDP literature [ll,fT8l . We refer to the 
semidefinite program specified in ((2T|) - ((Z!3)) . (|2^ -(|23 | and (|2?6)) - ((^ . Furthermore, we 
always assume that the problem instance (C, b, Ai, . . . , Am) is generic, in the sense discussed 
above. 

Proposition 5 (Pataki's Inequalities ITS', Corollary 3.3.4], see also [T]) 

Let r and n — r be the ranks of the optimal matrices Y = A{y) and X . Then 

-r')^™ cr)^("r)-- <"> 

A proof will be furnished later in this section. First we illustrate Proposition [5] by de- 
scribing a numerical experiment which is easily performed for a range of values (m, n). We 
generated m-tuples of n x n- matrices {Ai, A2, . . . , Am), where each entry was independently 
drawn from the standard Gaussian distribution. Then, for a random positive semidefinite 
matrix Xq, let bi — Ai • Xq, which makes the feasible set (|2.2p - (|2.3|) is nonempty. For each 
such choice, we generated 10000 random symmetric matrices C. Using SeDuMi [2T| we then 
solved the program (|2.1|) - ()2.3p and we determined the numerical rank of its optimal solution 
Y . This was done by computing the Singular Value Decomposition of Y in Matlab. The 
result is the rank distribution in Table [TJ 

Table [T] verifies that, with probability one, the rank r of the optimal matrix Y lies in 
the interval specified by Pataki's Inequalities. The case m = 2, which concerns Vinnikov 
curves as in Example [TJ is not listed in Table [T] because here the optimal rank is always 
n — 1. The first interesting case is m = n = 3, which concerns Cayley's cubic surface as 
in Example [21 When optimizing a linear function over the convex surface in Figure [31 it is 
three times less likely for a smooth point to be optimal than one of the four vertices. For 
m = 3,n = 4, as in Example [H the odds are sHghtly more balanced. In only 35.34% of 
the instances the algebraic degree of the optimal solution was 16, while in 64.66% of the 
instances the algebraic degree was found to be 10. 
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n 


3 


4 


5 


6 


m 


rank percent 


rank percent 


rank percent 


rank percent 


3 


2 24.00% 
1 76.00% 


3 35.34% 
2 64.66% 


4 79.18% 
3 20.82% 


5 82.78% 
4 17.22% 


4 


1 100.00% 


3 23.22% 
2 76.78% 


4 16.96% 
3 83.04% 


5 37.42% 
4 62.58% 


5 


1 100.00% 


2 100.00 % 


4 5.90% 
3 94.10% 


5 38.42% 
4 61.58% 


6 




2 67.24% 
1 32.76% 


3 93.50% 
2 6.50% 


5 1.32% 
4 93.36% 
3 5.32% 


7 




2 52.94% 
1 47.06% 


3 82.64% 
2 17.36% 


4 78.82% 
3 21.18% 


8 




1 100.00% 


3 34.64% 
2 65.36% 


4 45.62% 
3 54.38% 


9 




1 100.00% 


3 7.60% 
2 92.40% 


4 23.50% 
3 76.50% 



Table 1. Distribution of the rank of the optimal matrix Y. 



This experiment highlights again the genericity hypothesis made throughout this paper. 
We shall always assume that the m-tuple {Ai, . . . , A.jn), the cost matrix C and vector b are 
generic. All results in this paper are only valid under this genericity hypothesis. Naturally, 
special phenomena will happen for special problem instances. For instance, the rank of the 
optimal matrix can be outside the Pataki interval. While such special instances are important 
for applications of SDP, we shall not address them in this present study. 

In what follows we introduce our algebraic setting. We fix the assumptions 

&i = 1 and 62 = &3 = • ■ • = Ki = 0. (3.2) 

These assumptions appear to violate our genericity hypothesis. However, this is not the case 
since any generic instance can be transformed, by a linear change of coordinates, into an 
instance of ([MD-dUD that satisfies (gSl). 

Our approach is based on two linear spaces of symmetric matrices, 

U = {C,Ai,A2,...,A„,) C 5", 
V = {A2,...,A„,) c U. 

Thus, U is a generic linear subspace of dimension m + 1 in 5", and V is a generic subspace 
of codimension 2 in U. This specifies a dual pair of flags: 

VcUc 5" and c C (5")*. (3.3) 

In the definition of U and V, note that C is included to define U and Ai is excluded to 
define V, this is because we want to discuss the problem in the projective spaces PiS" and 
FU whose elements are invariant under scaling. So we ignored the constraint Ai • X ~ 1. 
With these flags the optimality condition (|2.6p - (|2.7p can be rewritten as 

X eV^ and Y eU and X • F = 0. (3.4) 

Our objective is to count the solutions of this system of polynomial equations. Notice that if 
{X, Y) is a solution and A, /i are non-zero scalars then the pair {XX, ^Y) is also a solution. 
What we are counting are equivalence classes of solutions (13. 4|) where {X, Y) and {XX, ^Y) 
are regarded as the same point. 
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Indeed, from now on, for the rest of this paper, we pass to the usual setting of algebraic 
geometry: we complexify each of our linear spaces and we consider 

PV c PW c P5" and c PV-^ C P(5")*. (3.5) 

Each of these six spaces is a complex projective space. Note that 

dim(PW) = m and dim(PV^) = 2 ^) " (3-6) 

If W is any of the six linear spaces in ()3.3p then we write Dy^ for the determinantal variety 
of all matrices of rank < r in the corresponding projective space PW in (|3.5p . Assuming 
that D'!^ is non-empty, the codimension of this variety is independent of the choice of the 
ambient space PW. By TO], we have 



codim(i^^v) ^ [ 2 )■ ^^-^^ 

We write {XY = 0} for the subvariety of the product of projective spaces P(iS")* x PiS" 
which consists of all pairs {X, Y) of symmetric n x n-matrices whose matrix product is zero. 
If we fix the ranks of the matrices X and Y to be at most n — r and r respectively, then we 
obtain a subvariety 

{XY^Or {XY = 0} n {D^-:^,xD^s-). (3.8) 

Lemma 6 The subvariety {XY — 0}'" is irreducible. 

Proof. Our argument is borrowed from Kempf 14J. For a fixed pair of dual bases in C" and 
(C")*, the symmetric matrices X and Y define linear maps 

X : C" ^ (C")* and Y : (C")* ^ C". 

Symmetry implies that ker{X) — lm{X)^ and ker(y) = Im(y)^. Therefore, XY = 
holds if and only if ker(X) D Im(y), i.e. ker(y) 3 ker(X)-'-. For fixed rank r and a 
fixed subspace K C C", the set of pairs {X,Y) such that kerX I) K and ker(y) 3 K-^ 
forms a product of projective linear subspaces of dimension ("^2^^) — 1 and C^^^) — 1 
respectively. Over the Grassmannian G{r,n) of dimension r subspaces in C", these triples 
{X,Y,K) G P(5")* X P5" X G{r,n) form a fiber bundle that is an irreducible variety. The 
variety {XY ~ Oy is the image under the projection of that variety onto the first two 
factors. It is therefore irreducible. □ 

Thus, purely set-theoretically, our variety has the irreducible decomposition 

n-l 

{XY = 0} = [j{XY = OY. (3.9) 

Solving the polynomial equations (|3.4|) means intersecting these subvarieties of P(iS" )* xPiS" 
with the product of subspaces PV^ x PU. In other words, what is specified by the optimality 
conditions (j2.6p - (|2.7p is a point in the variety 

{XY = OY n (PV-L xFU) , . 

= {XY = 0} n (d:^-^'- X Djj) ^ ' 

which satisfies p.Sp . Here r is the rank of the optimal matrix A{y). 
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In the special case of linear programming, the matrices X and Y in (|3.10p are diagonal, 
and hence {XY = 0}'" is a finite union of linear spaces. Therefore the optimal pair {X,Y) 
satisfies some system of linear equations, in accordance with the fact that the algebraic 
degree of linear programming is equal to one. 

The following is our main result in this section. 

Theorem 7. For generic 11 and V, the variety IS. 1 0|) is empty unless Pataki's inequalities 
113.1]) hold. In that case the variety kS.lOfi is reduced, non-empty, zero- dimensional, and at 
each point the rank of X and Y is n — r and r respectively. The cardinality of this variety 
depends only on m,n and r. 

The rank condition in Theorem [7] is equivalent to the strict complementarity condition 
stated after Theorem [3l Theorem [7] therefore implies: 

Corollary 8 The strict complementarity condition holds generically. 

We can now also give an easy proof of Pataki's inequalities: 

Proof of Proposition^^ The inequalities p.ip are implied by the first sentence of Theorem [7l 
since the variety (|3.10p is not empty. □ 

Proof of Theorem^ Using (|3.6p and p.7p . we can rewrite (|3.ip as follows: 

codim(i:>^„) < dim(PiY) and codim(i:>^\;^:^. ) < dim(PV-^). 

Pataki's inequalities are obviously necessary for the intersection p.lOp to be non-empty. 
Suppose now that these inequalities are satisfied. 

We claim that the dimension of the variety {XY = Oy equals ("^^) — 2. In particular, 
this dimension is independent of r. To verify this claim, we first note that there are ("^^) — 
1 — ("~2^^) degrees of freedom in choosing the matrix Y S Dg^. For any fixed matrix Y of 
rank r, consider the linear system X ■ Y = of equations for the entries of X. Replacing 
y by a diagonal matrix of rank r, we see that the solution space of this linear system has 
dimension (" 2^^) — 1. The sum of these dimensions equals ("J"'^) — 2 as required. Hence 

dim({Xr = O}'") + dim(PV^ x PU) = dim(P(5")* x P5"). 

By Bertini's theorem [9l Theorem 17.16] for generic choices of the linear spaces U and V, the 
intersection of {XY = 0}'" with PV^ x PU is transversal, i.e., in this case finite, each point 
of intersection is smooth on both varieties, and the tangent spaces intersect transversally. 
Thus the rank conditions are satisfied at any intersection point. Furthermore, when the 
intersection is transversal, the number of intersection points is independent of the choice of 
U and V. 

We therefore exhibit a non-empty transversal intersection. Fix any smooth point {Xq, Iq) G 
D^gjy X with Xq -Yq = 0. This means that Xq has rank n — r and Yq has rank r. After 
a change of bases, we may assume that Xq is a diagonal matrix with r zeros and n~r ones 
while Yq has r ones and n — r zeros. 

For a generic choice of a subspace containing Xq and a subspace U containing Yq, 
the intersection {XY = Oy n (PV^ x PU) is transversal away from the point {Xq, Yq). We 
must show that the intersection is transversal also at {Xq, Yq). We describe the affine tangent 
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space to {XY ^ Oy at {Xq, Yq) in an affine neighborhood of {Xq, Yq) e P(5" 
affine neighborhood is the direct sum of the affine spaces parameterized by 



P5". The 



X 



1,1 



X, 



l,r ' ' ' Xj^ f yv^ 
Xl^r+1 ••• Xr,r+1 1 + -'^r+l.r+l 



Xr- 



Xi n 



Xr^n 
Xr+l,n 



Xl^n-1 ■ ■ ■ 
Xl.n ■ ■ ■ 



1 ^1,2 

l + i"2,2 

Yl,r 
^l,r+l 

^1,71 



1 + ^n-l,n-l ^ri-l,n 
Xn-l,n 1 



Y2,n 



and 



(3.11) 



1 "I"" ^^\7" 



r,r+l 



Y 



Y 

^ n.n 



(3.12) 



In the coordinates of the matrix equation XY = 0, the hnear terms are 



Xij for i < j < 



Yi_j for r 



1 ^ * ^ .? and Xij + J for i < r < j < n 



These hnear forms are independent, and their number equals the codimension of {XY = Oy 
in P(<S")* X PvS". Hence their vanishing defines the affine tangent space at the smooth point 
{XqjYq). For a generic choice these hnear terms are independent also of V and U-^, which 
assures the transversality at (Xo,lb). □ 
From Theorem[7]we know that the cardinality of the finite variety (|3.10|) is independent of 
the choice of generic U and V, and it is positive if and only if Pataki's inequalities p. II) hold. 
We denote this cardinality by 5(m, n,r). Our discussion shows that the number 6(171, n,r) 
coincides with the algebraic degree of SDP, which was defined (at the end of Section 2) as 
the highest degree of the minimal polynomials of the optimal solution coordinates iji and 



4. A table and some formulas 

The general problem of semidefinite programming can be formulated in the following primal- 
dual form which was derived in Section 3. An instance of SDP is specified by a flag of linear 
subspaces V C C 5" where dim(V) = m — 1 and dim(W) = m + 1, and the goal is to find 
matrices X,Y ^ 5" such that 

X eV^ and YeU and X-Y^O and X,YhO- (4.1) 

Ignoring the inequality constraints A, 1" ^ and fixing the rank of A to be n — r, the 
task amounts to computing the finite variety ()3.10p . The algebraic degree of SDP, denoted 
S{m,n,r), is the cardinality of this projective variety over C. A first result about algebraic 
degree is the following duality relation. 

Proposition 9 The algebraic degree of SDP satisfies the duality relation 

5{m,n,r^ = (5(("J"'^) — m, n, n — r) . (4-2) 
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n = 2 


n = 3 


n = 4 


n = 5 


n = 6 


m 


r degree 


r degree 


r degree 


r degree 


r degree 


1 


1 2 


2 3 


3 4 


4 5 


5 6 


2 


1 2 


2 6 


3 12 


4 20 


5 30 


3 




2 4 
1 4 


3 16 
2 10 


4 40 
3 20 


5 80 
4 35 


4 




1 6 


3 8 
2 30 


4 40 
3 90 


5 120 
4 210 


5 




1 3 


2 42 


4 16 
3 207 


5 96 
4 672 


6 






2 30 
1 8 


3 290 
2 35 


5 32 
4 1400 
3 112 


7 






2 10 
1 16 


3 260 
2 140 


4 2040 
3 672 


8 






1 12 


3 140 
2 260 


4 2100 
3 1992 


9 






1 4 


3 35 
2 290 


4 1470 
3 3812 



Table 2. The algebraic degree &{m,n,r) of semidefinite programming 



Proof. For generic C,Ai,b, by Corollary [H the strict complementarity condition holds. So 
in jMI-llMl), if rank(X*) = r, then rank(A(y*)) = n - r. But the dual problem ([2:41) - ([23]) 
can be written equivalently as some particular primal SDP (|2.ip - (|2.3p with to' = ("J^) — to 
constraints. The roles of X and A{y) are reversed. Therefore the duality relation stated in 
(g^) holds. □ 

A census of all values of the algebraic degree of semidefinite programming for m < 9 and 
n < 6 is given in Table [2l Later, we shall propose a formula for arbitrary to and n. First, let 
us explain how Table [2] can be constructed. 

Consider the polynomial ring y] in the n(n + 1) unknowns Xij and yij, and let 
(XY) be the ideal generated by the entries of the matrix product XY. The quotient R = 
Q[X, Y\l (XY) is the homogeneous coordinate ring of the variety [XY — 0}. For fixed rank r 
we also consider the prime ideal {XY)^"^^ and the coordinate ring i?^'''' — Y\l {XY)^"^^ 
of the irreducible component {XY = 0}'' in ()3.9p . The rings R and R^"^^ are naturally graded 
by the group 1? . The degrees of the generators are deg(a::ij) = (1,0) and deg(j/y) = (0, 1). 

A convenient tool for computing and understanding the columns of Table [2] is the notion 
of the multidegree in combinatorial commutative algebra [17]. Recall from [17', Section 8.5] 
that the multidegree of a Z'^-graded afhne algebra is a homogeneous polynomial in d un- 
knowns. Its total degree is the height of the multihomogeneous presentation ideal. If d = 2 
then we use the term bidegree for the multidegree. Let C{R;s,t) and C{R^^^; s,t) be the 
bidegree of the Z^-graded rings R and R^^^ respectively. Since the decomposition p.Op is 
equidimensional, additivity of the multidegree |17[ Theorem 8.53] implies 

n 

C{R-s,t) = ^C(i?W;s,i). 

The following result establishes the connection to semidefinite programming: 

Theorem 10. The bidegree of the variety {XY = 0}'' is the generating function for the 
algebraic degree of semidefinite programming. More precisely, 

C{R^'-^;s,t) = ^ (5(TO,n,r)-s("^')-™-t". (4.3) 

m— 
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Proof. Fix the ideal / ~ {XY}^'^\ and let in(/) be its initial monomial ideal with respect 
to some term order. The bideeree remains unchanged if we replace / by in(/) 

in the definition of R^^\ The same holds for the right hand side of (|4.3|) because we can 
also define S{m,n,r) by using the initial equations in(/) in the place of {XY — 0}^"^^ 
in (|3.10p . By additivity of the multidegree, it suffices to consider minimal prime ideals of 
in(/). These are generated by subsets of the unknowns Xij and ijij. If such a prime contains 
("2^^) ~ ^ unknowns Xij and m unknowns yij then its variety intersects PV^ x ¥U only if 
dim(PW) = m. In this case the intersection consists of one point. Hence the contribution to 
the bidegree equals s( ^ . ^s claimed. □ 



This formula (|4.3p is useful for practical computations because, in light of the degen- 
eration property |171 Corollary 8.47], the bidegree can be read off from any Grobner basis 
for the ideal {XY) . Such a Grobner basis can be computed for small values of n but no 
general combinatorial construction of a Grobner basis is known. Note that if we set s = < = 1 
in C{R^^^; s, t) then we recover the ordinary Z-graded degree of the ideal {XY)'^^^ . 

Example 1. We examine our varieties {XY = 0} for n = 4 in Macaulay 2: 

R = qq[xll,xl2,xl3,xl4,x22,x23,x24,x33,x34,x44, 

yll,yl2,yl3,yl4,y22,y23,y24,y33,y34,y44] ; 

X = matrix {{xll, xl2, xl3, xl4}, 

{xl2, x22, x23, x24}, 

{xl3, x23, x33, x34}, 

{xl4, x24, x34, x44»; 

Y = matrix {{yll, yl2, yl3, yl4}, 

{yl2, y22, y23, y24}, 

{yl3, y23, y33, y34}, 

{yl4, y24, y34, y44»; 

minors(l,X*Y) + minors(2,X) + minors(4,Y) 

minors(l,X*Y) + minors(3,X) + minors(3,Y) 

minors(l,X*Y) + minors(4,X) + minors(2,Y) 



codim 00, degree 00 
codim 00, degree 00 



codim 00, degree 00 

This Macaulay 2 code verifies that three of the five irreducible components {XY = 0}^^^ 
in p.gp all have codimcnsion 10, and it computes their Z-graded degree: 

4 

C(i?^3>; 1,1) = ^(5(to,4,3) = 4+12+16-F8 = 40, 

m— 1 
7 

C(i?^2>; 1,1) = ^(5(to,4,2) = 10 -f 30 + 42 -I- 30 -I- 10 = 122. 

m— 3 

The summands are the entries in the n = 4 column in Table [21 We compute them by 
modifying the degree command so that it outputs the bidegree. □ 

We now come to our main result, which is a collection of explicit formulas for many of 
the entries in Table [21 organized by rows rather than columns. 

Theorem 11. The algebraic degree of semidefinite programming, S{m,n,r), is determined 
by the following formulas for various special values of m, n, r: 

1. If the optimal rank r equals n — 1 then we have 



d{m,n,n-l) = 2" 



m 
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2. If the optimal rank r equals n — 2 and 3 < m < 5 then we have 

S{3,n,n-2) = (^"^ + ^ 

J(4,n,n-2) = 6 + ^ 

3. If the optimal rank r equals n — 3 and 6 < m < 9 then we have 

fn + 2\ fn + 2 

Si7,n,n-3) = 28 (" ; ^) - 12 (" ^ ^ 

^(8,n,n-3) = 248(" J ^) - 32o(^^ ; ^) + 84(" ; ^ 

Si9,n,n-3) = 1794 ("J^) - 3778 ("J^) + 2436 (";^) - 448 ("^^ 

Theorem [11] combined with Proposition [9] explains all data in Table O except for four 
special values which were first found using computer algebra: 

S{6, 6, 4) = 1400, (5(7, 6, 4) = 2040, 6(8, 6, 4) = 2100, 6(9, 6, 4) 1470. (4.4) 

An independent verification of these numbers will be presented in Example 1201 This will 
illustrate our general formula which is conjectured to hold arbitrary values of m, n and r. 
That formula is stated in Theorem \W\ and Conjecture [^H 

We note that an explicit and completely general formula for the algebraic degree 5(m, n, r) 
was recently found by von Bothmer and Ranestad [5S]. 



5. Determinantal varieties and their projective duals 

In Theorem [ini the algebraic degree of SDP was expressed in terms of the irreducible com- 
ponents defined by the symmetric matrix equation XY = 0. In this section we relate this 
equation to the notion of projective duality, and we interpret J(m, n, r) as the degree of the 
hypersurface dual to the variety DJj. 

Every (complex) projective space P has an associated dual projective space P* whose 
points w correspond to hyperplanes {w = 0} in P, and vice versa. Given any (irreducible) 
variety V c¥, one defines the conormal variety CV of V to be the (Zariski) closure in P* x P 
of the set of pairs (w, v) where {w = 0} C P is a hyperplane tangent to 1/ at a smooth point 
V gV. The projection of CV in P* is the dual variety V* to V. The fundamental Biduality 
Theorem states that CV is also the conormal variety of V* , and therefore V is the dual 
variety to V* . For proofs and details see [H §1.1.3] and [22l §1.3]. 

In our SDP setting, we write f{S'^)* for the projective space dual to PiS". The conormal 
variety of the determinantal variety is well understood: 

Proposition 12 [71 Proposition 1.4.11] The irreducible variety {XY — QY in P(iS")* x 
PiS" coincides with the conormal variety of the determinantal variety and likewise of 

^^S")* ■ particular D?Z^^, is the dual variety to Dg^- 
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Proof. Consider a symmetric n x n-matrices Y of rank r. We may assume that Y is diagonal, 
with the first r entries in the diagonal equal to 1, and the remaining equal to 0. In the affine 
neighborhood of Y, given by Yi i ^ 0, the matrices have the form (|3.12p . The determinantal 
subvariety intersects this neighborhood in the locus where the size r + 1 minors vanish. 
The linear parts of these minors specify the matrices X that define the hyperplanes tangent 
to at Y. The only such minors with a linear part are those that contain the upper left 
r X r submatrix. Furthermore, their linear part is generated by the entries of the lower right 
(n — r) X (n — r) matrix. But the matrices whose only nonzero entries are in this lower right 
submatrix are precisely those that satisfy our matrix equation X ■ Y — 0. □ 

Theorem 13. LetlA he a generic (m + 1)- dimensional linear subspace of S^^ , and consider 
the determinantal variety DJj of symmetric matrices of rank at most r in ¥U. Then its dual 
variety {D^)* is a hypersurface if and only if Pataki's inequalities \3.1\) hold, and, in this 
case, the algebraic degree of semidefinite programming coincides with the degree of the dual 
hypersurface: 



Proof. Recall the codimension 2 inclusions V C W in S" and C in S"*. The 
space of linear forms U* is naturally identified with the quotient space S'^* /U^, and hence 
f{S'^*/U^) = ¥U*. The image of the induced rational map PV^ ¥U* is the projective 
line := P(V^/Z//^). The points on the line correspond to the hyperplanes in ¥U that 



Note that the rational map tt is only defined outside PU x PU. 

We already know that 6{m, n, r) is the cardinality of the finite variety 

Z {XY = Oy n (PV^ X PU) {XY = 0} n {D'^-'' X Dli) . 

Since U-^ is generic inside V^, none of the points of Z lies in PU^ x PU, so we can apply tt 
to Z, and the image 7r(Z) is a finite subset of xPU C PU* x PU. 

We next prove that Z and it{Z) have the same cardinality. By Proposition [T^ a point on 
{XY = QY is a pair {X, Y) where the hyperplane {X — 0} is tangent to the determinantal 
variety -DJ„ at the point Y. Thus, if (Xo, Fq) and {Xx,Yo) are distinct points in Z that have 
the same image under tt, then both {X^ = 0} and {Xi = 0} contain the tangent space to 
at Yq. The same holds for {sXo + tXi — 0} for any s,i: £ C. Hence Z contains the 
entire line {{sXq +tXi, Yq) : s,t ^ C}, which is a contradiction to Z being finite. Therefore 
TT restricted to Z is one to one, and we conclude #7r(Z) = 6(m, n, r). 

A point {X, Y) in Z represents a hyperplane {X — 0} in PiS" that contains the subspace 
PV and is tangent to D^,^ at a point Y in PU, i.e. it is tangent to the determinantal variety 
DJj at this point. Consider the map {X = 0} i-^ {X = 0} n PU which takes hyperplanes 
in PiS" to hyperplanes in FU. This map is precisely the rational map PV^ FU* defined 
above. The image of {X, Y) & Z under tt thus represents a hyperplane {X = 0} flPW that 
contains the codimension 2 subspace PV and is tangent to DJ^ at a point Y . The hyperplanes 
in FU that contains PV form the projective line P^ := P{V^ /U^) in PU* , so tt{Z) is simply 
the set of all hyperplanes in that P^ which are tangent to D^. Equivalently, 7r(Z) is the 
intersection of the dual variety {DJ^)* with a general line in PU* . Hence tt{Z) is nonempty if 
and only if (D^)* is a hypersurface, in which case its cardinality 5{m,n,r) is the degree of 
that hypersurface. The first sentence in Theorem [7] says that Z ^ if and only if Pataki's 
inequalities hold. □ 



5{m, 71, r) 



deg{DJj)*. 




F^ xPU C FU* X PU. 



FU* induces a map in the first 
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The examples in the Introduction serve as an illustration for Theorem [131 In Example [TJ 
the variety is a cubic Vinnikov curve and its dual curve (Dl^)* has degree 5{2, 3, 1) = 6. 
Geometrically, a general line meets the (singular) curve in Figure 2 in six points, at least 
two of which are complex. A pencil of parallel lines in Figure 1 contains six that are tangent 
to the cubic. 

In ExamplelU the variety D^ is the Cayley cubic in Figure 3, and the variety Dj^ consists 
of its four singular points. The dual surface {D^)* is a Steiner surface of degree 4, and the 
dual surface (Dl^)* consists of four planes in VU* ~ P"^. Thinking of as a limit of smooth 
cubic surfaces, we see that each of the planes in {D^)* should be counted with multiplicity 
two, as the degree of the surface dual to a smooth cubic surface is 12 = 4 + 2-4. 

In general, the degree of the hypersurface (DJj)* depends crucially on the topology and 
singularities of the primal variety D^. In what follows, we examine these features of deter- 
minantal varieties, starting with the case U = . 

Proposition 14 The determinantal variety of symmetric matrices of rank at most r 

has codimension (""2^"'^) and is singular precisely along the subvariety -D^^^ of matrices of 
rank at most r — 1 . The degree of D^^ equals 



n-r-1 ( n+j \ 

\n—r—jJ 



n 

J=0 



Proof. The codimension, and the facts that Dg„ is singular along D^,J , appears in [9l 
Example 14.16]. The formula for the degree is [TD] Proposition 12]. □ 

When Djj is finite, then the above formula suffices to determine our degree. 

Corollary 15 The algebraic degree of semidefinite programming satisfies 



n-r— 1 ^ n+j j 

(5(m,n,r) = Y[ pV+iN provided m = (""2^^). 



Proof. If W C 5" is a generic subspace of dimension ("'"2^'^) + 1, then and _DJ„ have 
complementary dimension in PiS", so — FlAClDgn is finite and reduced, with cardinality 
equal to the degree of Dg„. The dual variety of a finite set in PU is a finite union of 
hyperplanes in PU*, one for each point. □ 

For m = n = 3 and r = I this formula gives us 5(3,3, 1) = 4. This is the number of 
singular points on the Cayley cubic surface in Example [2l 

More generally, whenever Z?^ is smooth (i.e. when -D^^^ — 0), the degree of the dual 
hypersurface may be computed by the following Class Formula. For any smooth variety X C 
P™ we write xi^) for its Euler number, which is its topological Euler-Poincare characteristic. 
Likewise, we write xi^ ^ H) and H iJ n H') for the Euler number of the intersection 
of X with one or two general hyperplanes H and H' in P™ respectively. 

Proposition 16 (Class Formula, 22, Theorem 10.1]) If X is any smooth subvariety ofP™ 
whose projective dual variety X* is a hypersurface, then 



degx* = (-i)<^W-(x(x) - 2.x(^ni/) + x(^niJniJ')) 
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This formula is best explained in the case when X is a curve, so that X OH is a finite set 
of points. The Euler number of X O H is its cardinality, i.e. the degree of X. Furthermore 
X n if n -ff' = 0, so the Class Formula reduces to 

degX* = -x{X) + 2-degX. (5.1) 

To see this, we compute the Euler number using a general pencil of hyperplanes, 

namely those containing the codimension 2 subspace HOH' C P™. Precisely d = degX* of 
the hyperplanes in this pencil are tangent to X, and each of these hyperplanes will be tangent 
at one point and intersect X in degX — 2 further points. So the Euler number is degX — 1 
for each of these hyperplane sections. The other hyperplane sections all have Euler number 
deg X and are parameterized by the complement of d points in a . By the multiplicative 
property of the Euler number, the union of the smooth hyperplane sections has Euler number 
(x(P^) — d) ■ (degX). By additivity of the Euler number on a disjoint union, we get the Class 
Formula dSH) for curves: x(^) = (xi^^) - d) ■ {degX) + d- {degX - 1) = 2-degX-d. 

The number xi^u) depends only on m, n and r, when U is general, so we set xC*^: '^j ^) •= 
x{Dli). When H and H' are general, the varieties D H and Ci H D H' are again gen- 
eral determinantal varieties, consisting of matrices of rank < r in a codimension 1 (resp. 
codimension 2) subspace of Z//. The Class Formula therefore implies the following result. 

Corollary 17 Suppose that (""2^^) < m < (""2^^). Then we have 

S{m,n,r) = (— ^ ) . ^ xC'tj, n, r) — 2 • x(™ ~ 1, ^t-, ^) + xC'ti — 2, n, r)) . 

Proof. The determinantal variety 1?^^ — ^ generic U C S"^ is nonempty if and 

only if m > codimZJ^y = (""j"*"^)' ^"^^ smooth as soon as is empty, i.e. when 

m < (" 2^^) . Therefore the Class Formula applies and gives the expression for the degree 
of the dual hypersurface (Z?^)*. □ 

The duality in Proposition [9] states (5(m,n,r) — ^(("J^) — m,n,n — r). So, as long as 
one of the two satisfies the inequalities of the Corollary [T71 the Class Formula computes the 
dual degree. In Tabled this covers all cases except (5(6,6,4), and it covers all the cases of 
Theorem 111! So, for the proof of Theorem I Hi it remains to compute the Euler number for 
a smooth DJj. We close with the remark that the Class Formula fails when DJj is singular. 



6. Proofs and a conjecture 

The proof of Theorem 1 1 1 1 has been reduced, by CoroUarv llTl to computing the Euler number 
for a smooth degeneracy locus of symmetric matrices. We begin by explaining the idea of 
the proof in the case of the first formula. 

Proof of Theorem \ll\ ( 1) Since 5(m, n, n — l) = — m, n, l), by Proposition[9l we may 

consider the variety of symmetric rank 1 matrices, which is smooth and coincides with the 
second Veronese embedding of P"^^. The determinantal variety Z?^ is thus a linear section 
of this Veronese embedding of P"^^. Equivalently, is the Veronese image of a complete 
intersection of m — 1 quadrics in P"^^. Our goal is to compute the Euler number of D^. 

By the Gauss-Bonnet formula, the Euler number of a smooth variety is the degree of 
the top Chern class of its tangent bundle. Let h be the class of a hyperplane in P"^^. The 
tangent bundle of I?^ is the quotient of the tangent bundle of P"~i restricted to and 
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the normal bundle of 0^ in P" ^ . The total Chern class of our deterniinantal manifold 
is therefore the quotient 

and the top Chern class Cn-m{D^) is the degree n ~ m term in c{Dl^). Similarly the top 
Chern class oi Djj n H and n H n H' is the de ffree n — m terms of 

■2h and . , , , • ih'^. 



(1 + 2/1)" (l + 2/i)™+i 

where the last factor indicates that we evaluate these classes on Z?^ and use the fact that 
H = 2h. By Proposition [ini the dual degree deg(D^)* is obtained by evaluating (— 1)"^™ 
times the degree n — m term in the expression 

(1 + ;^)" , (1 + hr ^ (1 + fe)" ^^^2 + 



(l + 2/i)™-i (l + 2/i)™ (1 + 2/1)"+! (l + 2/i)"+i' 

That term equals (^) • /i""™. Since the degree of our determinantal variety equals degD^ = 
Jjji /i""™ = 2"-i, we conclude deg(i:>i)* = (J^) • 2"-™. □ 

In the general case we rely on a formula of Piotr Pragacz [19] . He uses Schur Q-functions 
to define an intersection number on P™ with support on the degeneracy locus of a symmetric 
morphism of vector bundles. Our symmetric determinantal varieties are special cases of this. 
Pragacz' intersection number does not depend on the smoothness of the degeneracy locus, 
but only in the smooth case he shows that the intersection number equals the Euler number. 
By Corollary 1 171 we then obtain a formula for (5(m, n, r) in the smooth range. 

To present Pragacz' formula, we first need to fix our notation for partitions. A partition 
A is a finite weakly decreasing sequence of nonnegative integers Ai > A2 > ■ • • > Afc > 0. In 
contrast to the usual conventions [16j , we distinguish between partitions that differ only by a 
string of zeros in the end. For instance, we consider (2, 1) and (2, 1, 0) to be distinct partitions. 
The length of a partition A is the number k of its parts, while the weight \X\ = Ai + . . . + A/c 
is the sum of its parts. The sum of two partitions is the partition obtained by adding the 
corresponding A,;. A partition is strict if Ai_i > Ai for all i = 1, . . . , fc — 1. The special 
partitions (A;, A; — 1, fc — 2, . . . , 1) and (fc, fc — 1, fc — 2, . . . , 1, 0) are denoted by p{k) and pvi{k) 
respectively. 

Now, let £^ be a rank n vector bundle on P™ with Chern roots xi,...,a;„, and let 
(j) : E* — > i? be a symmetric morphism. Consider the degeneracy locus D^{(j)) of points in 
P™ where the rank of is at most r. For any strict partition A let Q\{E) be the Schur 
Q-function in the Chern roots (see [Sl ITFlfT^ for definitions). Thus Q\{E) is a symmetric 
polynomial in a;i, . . . , a;„ of degree equal to the weight of A. Pragacz defines the intersection 
number 

e{D^m^ f ^(-1)1^1 •((A + po(n-r-l)))-Q(,+,(„_,))(i?).c(P"), (6.1) 

where the sum is over all partitions A of length n — r and weight |A| < m — (" 2^^)- Let us 
carefully describe the ingredients of the formula. The factor c(P™) is the total Chern class 
(1 + /i)''"+i of P™. For any strict partition A := (Ai > A2 > • • • > Afc > 0), the factor ((A)) is 
an integer which is defined as follows. It depends on k whether or not Afc = 0. If the number 
fc is 1 or 2 then we set ((?)) := 2' and 
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In general, when k is even, we set 

((Ai,...,Afc)) Pfafr(((A„AO)),<,. 

Here "PfafF denotes the Pfajjian (i.e. the square root of the determinant) of a skew- 
symmetric matrix of even size. Finally, when k is odd, we set 

k 

((Ai,...,A,)) := ^(-iri.2^-((Ai,...,A;,...,A,)). 

j=i 

Proposition 18 (Pragacz, [19, Prop. 7.13]) If the degeneracy locus D'^{(j)) is smooth and 
of maximal codimension ("^!^^"'^) then x(Z?''((/))) = e(Z?''((/))). 

In our situation, the morphism (j) arises from the space U and is given by a symmetric 
matrix whose entries are linear forms on P™. We thus apply the trick, used in |lOj and also 
in [6l Section 6.4], of formally writing 

E = Op„.(|) © ■■■ ® Op™(|) = nOp„.(^). 

Here is a map nOpm(—!^) nOpm{^) and the determinantal variety D'lf is the locus of 
points where this map has rank at most r. The n Chern roots of the split vector bundle E are 
all equal to h/2. Thus, in applying Pragacz' formula (j6.ip . we restrict the Schur Q-functions 
to the diagonal xi = . . . — Xn — h/2. 

The result of this specialization of the Schur Q-function is an expression 

Q^{E) = b^(n)-h\^K (6.2) 

where b\{n) is a polynomial in n with b{Z) C Z[l/2]. We multiply (|6.2p with the comple- 
mentary Chern class of P™, which is the expression 

C|A|(P ) - _ _ |;,| j 

Pragacz' intersection number (j6.ip now evaluates to 

^(_1)|A| . ((A + p,(n - r - 1))) • 6(A+p(„-.))(n) • (^^ _ ^1",+?^ _ |^|) . (6.3) 

We abbreviate the expression (|6.3p by e{m,n^r). Note that e{m,n,r) is a polynomial in n 
for fixed m and r. In the smooth range we can now apply Corollary [T7] to obtain a formula 
for the degree of the dual variety (DJj)*: 

S{m, n, r) = (— 1)'™^( 2 )) . (e(m, n, r) — 2 • e(m — 1, n, r) -f e(m — 2, n, r)) . 
This yields a formula for the algebraic degree of semidefinite programming: 
Theorem 19. // (""3^^) < m < (""3^^) t/ien 

5(m, n, r) = {-if ■ ^(-1)1^1 • ((A + p,{n - r - 1))) • 6(A+,(„_.))(n) • {j'^l^ , 

where d = m — (" 2^"'^) is the dimension of the variety DJj and the sum is over all 
partitions A of length n — r and weight |A| < m — (""j^^) • 
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Proof. It remains only to apply Corollary II 71 Comparing the formulas for e(m, n, r), e(m — 
1, n, r) and e(m — 2, n, r), the difference is in the number of summands and in the binomial 
coefficient in the last factor. But the relation 

771 + 1\ / m\ /to— 1\ /to— 1 
k ) ^ \k - l) ^ \ k - 2) ^ k 

holds whenever k > 1, while {"'+^) - 2(';;') = {"'-^) and ('"+^) = {"'^^). So the formula for 
S{m, 71, r) reduces to the one claimed in the theorem. □ 

The formula for S{m,n,r) in Theorem 1191 is explicit but quite impractical. To make it 
more useful, we present a rule for computing the polynomials (tt-) for any ii > . . . > 

ik > 0. First, let bi{n) be the coefficient of in 

|i±^ = boin) + hin)-h+--- + b,in)-h'' + --. 

The coefficient bi — bi{n) is a polynomial of degree i in the unknown n, namely, 

6n = 1 , bi — n , bo = —T? , b-\ — —r? H n , 64 = — H r? , (6.4) 

u , i ' ^ 2 6 12 24 12 ^ ' 

111 1 1 23 

65 = H H n , ha = n*^ H n* H , . . . (6.5) 

^ 120 24 80 ' 720 72 1440 ' ^ ^ 

We next set b^i Q-j{n) — bi{n) and 

j 

6(ij)(n) = 6i(7i) • 6j(?i) - 2 • ^ (-1)''"^ • bi+k{n) ■ bj^k{n). 

k=l 

The general formula is now given by distinguishing three cases: 

^(»i,...,ifc)(?^) = Pfaff(6(i^,ij))s<t if k is even, 

b{n,...,ik){n) = ^(ii,...,ifc,o)(") if is odd and ik > 0, 

..,ifc_i,o)(") = ^(ii,...,jfc_i)(") if is odd and ik = 0. 

Proof of Theorem[Tl\ (2), (3) All seven formulas are gotten by specializing the formula in 
Theorem [T9l Let us begin with the first one where to = 3 and r — n — 2. Here D^~^ is 
0-dimensional and both the Euler number and the dual degree computes the number of 
points in D^~^ . The formula says 

<5(3,n,n-2) = ^.^-1)^^^ ■ {{Xi + ■ b^^,+2M+i)i^) ' {^q ^ ■ 

The only partition (Ai, A2) in the sum is (0, 0). Hence 5(3, n,n — 2) equals 

2n^ + n fn + l 



((l,0)).6(2,i)(n) = b,{n)b,{n)-2b,{n) = n- - 2^^ = 



3 



Next consider the case to = 4 and r = n — 2. Here ^ has dimension d ~ 1, and the sum 
in our formula is over the two partitions A — (0, 0) and A = (1, 0): 

5(4,n,n-2)= - 3 • ((1, 0)) • 6(2,i)(n) + ((2, 0)) • 6(3,1) (n) 

- 3 • (62(^)61 (n) - 263(n)) + 3 • (&3(^)&i(") - 2fe4(n)) 
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If we substitute (|6.4p into this expression, then we obtain 6("^^) as desired. 

The other five cases are similar. We derive only one more: for m = 8 and r = n — 3, the 
sum is over four partitions (0,0,0), (1,0,0), (2,0,0) and (1,1,0): 

Si8,n,n-3) = 21 • ((2, 1, 0)) • &32i(n) - 7 • ((3, 1, 0)) • 642i(n) 

+ l-((4,l,0))-&52i(ri) + 1- ((3,2,0)) -6431(^1). 

After applying the Pfaflian formulas 

hjk = ■ bko - hkbjo + b.kho and ((z, j, k)) = 2\{j, k)) - k)) + 2'=((i, j)), 

we substitute (I6.4p - (|6.5p into this expression and obtain the desired result. □ 

Example 20 Consider the four special entries of Table\^ which are listed in |/^.^[ ). Using 
the duality of Proposition we rewrite these values as 

(5(m,6,2) = (5(21 -TO, 6, 4) for to = 15, 14, 13, 12. 

The last three cases satisfy our hypothesis 10 = (^~2^^) < 21 — to < (^ 2^^) = 15, so 
Theorem I JffI applies and furnishes an independent proof of the correctness of these values in 
i4-4\ l- The only remaining entry in Table\^is 5(6,6,4) — 1400. The formula in Theorem [7^ 
correctly predicts that value, too. □ 

We conjecture that the formula of Theorem [12] holds in the general singular case. The 
formula does indeed make sense without the smoothness assumption, and it does give the 
correct number in all cases that we have checked. Experts in symmetric function theory might 
find it an interesting challenge to verify that the conjectured formula actually satisfies the 
duality relation (|4.2p . 

Conjecture 21 The formula for the algebraic degree of semidefinite programming in The- 
orem [TPI holds without the restriction in the range of m. 
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